DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...10
Hits 1 – 20 of 189

1
Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-9) 2021. Limerick, 12 July 2021 (Online-Event) ...
Lüngen, Harald; Kupietz, Marc; Bański, Piotr. - : Leibniz-Institut für Deutsche Sprache, 2021
BASE
Show details
2
Addressing Cha(lle)nges in Long-Term Archiving of Large Corpora
Arnold, Denis [Verfasser]; Fisseni, Bernhard [Verfasser]; Kamocki, Paweł [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
3
Using Full Text Indices for Querying Spoken Language Data
Frick, Elena [Verfasser]; Schmidt, Thomas [Verfasser]; Bański, Piotr [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
4
Proceedings of the LREC 2020 Workshop, Language Resources and Evaluation Conference, 11–16 May 2020, 8th Workshop on Challenges in the Management of Large Corpora (CMLC-8)
Bański, Piotr [Herausgeber]; Barbaresi, Adrien [Herausgeber]; Clematide, Simon [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
5
Evaluating a Dependency Parser on DeReKo
Fankhauser, Peter [Verfasser]; Do, Bich-Ngoc [Verfasser]; Kupietz, Marc [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
6
The FAIR Index of CMC Corpora
In: CMC Corpora through the prism of Digital Humanities ; https://hal.archives-ouvertes.fr/hal-03121698 ; CMC Corpora through the prism of Digital Humanities, 2020 (2020)
BASE
Show details
7
Proceedings of the LREC 2020: 8th Workshop on Challenges in the Management of Large Corpora (CMLC-8)
In: Proceedings of the LREC 2020: 8th Workshop on Challenges in the Management of Large Corpora (CMLC-8). Edited by: Bański, Piotr; Barbaresi, Adrien; Clematide, Simon; Kupietz, Marc; Lüngen, Harald; Pisetta, Ines (2020). Marseille, France: European Language Ressources Association. (2020)
BASE
Show details
8
What's New in EuReCo? Interoperability, Comparable Corpora, Licensing
Kupietz, Marc [Verfasser]; Margaretha, Eliza [Verfasser]; Diewald, Nils [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
9
The Vast and the Focused: On the need for domain-focused web corpora
Barbaresi, Adrien [Verfasser]; Bański, Piotr [Herausgeber]; Barbaresi, Adrien [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
10
Types and annotation of reply relations in computer-mediated communication
Lüngen, Harald [Verfasser]; Herzberg, Laura [Verfasser]. - Mannheim : Universitätsbibliothek Mannheim, 2019
DNB Subject Category Language
Show details
11
Asynchronous pipelines for processing huge corpora on medium to low resource infrastructures
Ortiz Suárez, Pedro Javier [Verfasser]; Sagot, Benoît [Verfasser]; Romary, Laurent [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
12
Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-7) 2019. Cardiff, 22 July 2019
Bański, Piotr [Herausgeber]; Barbaresi, Adrien [Herausgeber]; Biber, Hanno [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
13
Modelling large parallel corpora. The Zurich Parallel Corpus Collection
Graën, Johannes [Verfasser]; Kew, Tannon [Verfasser]; Shaitarova, Anastassia [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
14
Deduplication in large web corpora
Benko, Vladimír [Verfasser]; Bański, Piotr [Herausgeber]; Barbaresi, Adrien [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
15
cmc-core: a basic schema for encoding CMC corpora in TEI
Lüngen, Harald [Verfasser]; Wigham, Ciara R. [Verfasser]; Marinica, Claudia [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
16
The best of both worlds: Multi-billion word “dynamic” corpora
Lüngen, Harald [Herausgeber]; Breiteneder, Evelyn [Herausgeber]; Barbaresi, Adrien [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
17
Types and annotation of reply relations in computer-mediated communication
Herzberg, Laura [Verfasser]; Lüngen, Harald [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
18
Datenübernahmerichtlinien des Leibniz-Instituts für Deutsche Sprache
Schmidt, Thomas [Verfasser]; Witt, Andreas [Verfasser]; Arnold, Denis [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2019
DNB Subject Category Language
Show details
19
Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection
In: Graën, Johannes; Kew, Tannon; Shaitarova, Anastassia; Volk, Martin (2019). Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection. In: Challenges in the Management of Large Corpora (CMLC-7), Cardiff, Wales, 22 July 2019 - 22 July 2019. (2019)
Abstract: Text corpora come in many different shapes and sizes and carry heterogeneous annotations, depending on their purpose and design. The true benefit of corpora is rooted in their annotation and the method by which this data is encoded is an important factor in their interoperability. We have accumulated a large collection of multilingual and parallel corpora and encoded it in a unified format which is compatible with a broad range of NLP tools and corpus linguistic applications. In this paper, we present our corpus collection and describe a data model and the extensions to the popular CoNLL-U format that enable us to encode it.
Keyword: 000 Computer science; 410 Linguistics; Institute of Computational Linguistics; knowledge & systems
URL: https://doi.org/10.14618/ids-pub-9020
https://www.zora.uzh.ch/id/eprint/175081/
https://doi.org/10.5167/uzh-175081
https://www.zora.uzh.ch/id/eprint/175081/1/Graen_Kew_Shaitarova_Volk_2019.pdf
BASE
Hide details
20
Types and annotation of reply relations in computer-mediated communication
Lüngen, Harald; Herzberg, Laura. - : de Gruyter, 2019
BASE
Show details

Page: 1 2 3 4 5...10

Catalogues
2
8
5
0
74
0
0
Bibliographies
10
0
4
1
0
0
4
0
1
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
23
0
61
1
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern